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ABSTRACT 


Reqression  analysis  based  on  agqreoative  data  is  sometimes  thought 
to  be  inferior  to  tlie  same  model   fitted  to  micro  data.     Tfie  latter, 
however,  is  often  unavailable.     In  this  paper  we  derive  bounds  for  the 
vanancos  of  the  nacro  paranetr-r  estimates  for  a    linear  regression  model 
with  random  coefficients  and  an  additive  error  term  and  show  that,  under 
plausible  conditions,  the  macro  pa -ameter  variances  will   diminish  as  the 
members  comprising  the  aggregate  increase,     it  is  also  shown  that  the 
Thei  I   aqnregation  weights   and  more  readi  Iv  obtainable  proportions    (to 
which  the  weights  are  usually  related)  must  decrease  in  size  as  the 
aggregate  population  is   increased  for  the  same  result  to  hold.     Under 
circumstances  that  often  exist  in  practice,  the  macro  estimates  will 
be  quite  efficient  ones. 
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1.1 


1 .  Introduction 

Let  6..  denote  the  vector  at  time  t  i,t=\  t'd,,..  ,T)   of  unobservable  para- 
meters of  the  micro  regression  structures  y^^  =  2i^^3_^^  +  e^^,  i  =  l,^,...,N, 
associated  with  N  individuals.  We  shall 
assume  that  these  micro  regression  parameters 

vary  with  time  according  to  a  wide-sense  stationary  stochastic  process  with 
E(lit)  ^  -  ^^^^   paper  considers  the  estimation  of  the  vector  6_  using  the 

N  N 

macro  data  Y.  =  i  y.,  and  X.  =  z  x... 
t   .^^  n    -t   .^^  -It 

This  type  of  problem  occurs  frequently  in  economics  and  other  social  sciences 
where  some  or  all  of  the  micro  data  (x.^.^,  y.^)  has  not  been  recorded  and 
only  the  macro  (aggregate)  data  is  available.  The  absence  of  micro  data  is  not 
the  only  case  when  we  might  want  to  consider  aggregation.  Given  the  micro  data,  one 
way  to  estimate  £  would  be  to  average  the  estimates  obtained  from  each  micro 
equation.  For  reasonable  N  this  involves  a  lot  of  computation,  and  perhaps 
an  estimate  based  on  the  aggregate  data  would  be  about  as  good  . 

In  an  earlier  oaper  (Kuh  (1972))  it  was  shown  under  certain  reasonable 
assumptions,  that  the  variances  of  the  estimated  macro  coefficients  decrease 
as  the  number  of  individuals  in  the  aggregate  increase.  Empirical  tests  of 
the  assumptions  supported  the  theoretical  propositions.  The  random  coeffi- 
cient assumption  used  there  differs  from  the  one  on  this  paner,  m  that 
randomness    was  conceived  to  be  cross-sectional  in  nature:  an  aggregate 
relation  may  include  heterogeneous  micro  oarameters  which  can  be  thought  of 
as  random  drawings  from  a  population  with  a  stable  mean.  Here,  randomness 
is  treated  as  a  time-series,  stationary  stochastic  process  where  each  micro- 
coefficient  vector  is  assumed  to  have  the  same  mean.  From  a  formal  point 
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of  view  the  sources  of  randomness  are  very  different,  but  in  both  cases, 
aggregation  gain,  defined  as  reduction  in  the  estimated  macro  variance  rela- 
tive to  the  micro  variances  can,  and  often  will  occur.  Suitable  aggregation, 
i.e.  aggregation  for  which  the  mean  population  parameters  (as  well  as  the 
random  process  laws)  behave  approximately  as  assumed,  thus  has  a  valid  justi- 
fication which  heretofore  has  been  absent.  Casual  agglomeration  of  data, 
however,  where  in  particular  the  population  micro  parameters  are  not  stable 
as  assumed,  will  evidently  cause  trouble.  A  subsequent  paper  using 

empirical  data  and  Monte  Carlo  experiments  will  explore  more  fully  the  impli- 
cations of  these  results  for  empirical  research,  some  of  which  have  already 
been  drawn  in  Kuh  (1972). 

The  results  presented  are  similar  to  those  of  Theil  (19(^8),  who  only 
treated  variability  in  the  pure  random  macro  population  parameter  case,  whereas  we 
treat  a  more  complete  model  as  well  as  estimation  properties.  Furthermore,  at 
the  end  of  this  naner,  the  relation  between  the  aqqreqation  weiqhts  of  Theil  (19b4)  and 
aggregation  is  shown.  Empirical  inferences  about  aggregation  weights  is 
available  (see  Kuh  (1972))  from  individual  shares  in  the  aggregate  that  cast 
some  light  on  the  question  of  aggregation  gain. 

The  next  three  sections  of  the  paper  present  the  basic  regression  model 
and  derive  bounds  for  the  macrovariances.  In  section  5,  the  limitinn  properties 
of  the  macrovariance  usinn  the  bounds  derived  earlier  show  v/hat  conditions 
must  hold  for  aggregation  gain  to  occur. 


2.1 
2.  The  Regression  Model 

For  the  micro  equations,  we  assume  the  existence  of  the  regression 
structures 

^'-'^       yn  =  ^t^t'^it  i=i,2,...,N 

t=l,2,...,T 

y..  is  the  dependent  variable, 

X..  is  a  1  X  K  vector  of  "explanatory"  variables, 

3^..  is  a  K  X  1  vector  of  unknown  regression  parameters, 

e.  is  the  additive  "error"  component. 


where 


and 


a.  E{..p  =  0 

b.  E(£..e.  )  =  a.&       6..   where  6  is  the  Kronecker  delta  and  the 

fX-JS      iSv   ij 

2  2    2 

o.  are  scalars  with  sup  o.   <  a   , 
^  i  .  ^ 

(2.2)    c.  B.,  is  the  realization  of  a  multivariate  wide-sense  stationary 

stochastic  process  with  £(6.^-^)  =  i  and  E(3.^.^-6_)(B.^-^^g-£) '  =  r^.(s; 

d.  The  processes  3.-^  and  b...  are  uncorrelated  for  i^j 

e.  B.  and  e .  are  uncorrelated  for  all  s  and  t  and  i  and  j. 

These  conditions  imply  that  we  have  N  uncorrelated  regression  equations 
where  the  parameters  in  each  equation  are  realizations  of  a  stochastic  process 
with  the  same  mean  vector.  For  each  i  this  model  is  identifical  to  that  pro- 
posed by  Burnett  and  Guthrie  (1970). 
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The  above  assumptions  require  comment.  (2.^a)  is  a  standard  assumption 
in  regression  analysis  that  we  retain  here.  (2.2b)  permits  the  additive 
error  variance  to  differ  among  individuals,  but  these  errors  are  postulated 
to  be  independent  across  micro  units.  To  the  extent  that  they  are  not, 
efficiency  gains  from  generalized  least  squares  estimation  which  demand  all 
micro  data  be  used,  will  be  foregone  [Swamy  (1970)].  (2.2c)  allows  for  what,  in 
principle,  could  be  a  conplex  autocorrelated  random  process  in  the  micro  co- 
efficients. Relaxation  of  the  assumption  that  the  population  micro  para- 
meters are  fixed  for  all  time  permits  a  substantial  increase  in 
realism.     Individual  firms  or  persons  often  behave  according  to  a  stable 
underlying  process,  but  that  behavior  often  departs  from  its  basic  (i.e. 
average)  modus  operandi  in  more  complicated  ways  than  are  represented  by  an 
additive  error  term.  (2. 2d)  asserts  that  random  coefficient  orocesses  across 
individuals  are  uncorrected.  We  have  somewhat  less  faith  in  the  validity 
of  this  assumption  than  the  others.   Oligopoly  interdependence  in  particular 
could  lead  to  violations  of  this  assumption.  In  neneral ,  however,  there  does  not 
appear  to  be  greater  departures  from  reality  in  this  instance  than  in  others 
made  in  the  estimation  of  economic  or  social  behavior  relationships.  Finally, 
(2.1e)  asserts  that  the  two  sources  of  randomness  are  uncorrelated,  a  proposi- 
tion that  is  convenient  and  does  not  appear  to  be  a  particular  cause  for  concern. 

In  summary,     the  random  coefficient  model  allows  for  much  richer 
behavioral  variations  that  should  be  considered  in  an  aggregation  context. 
Since  there  are  two  sources  of  random  variation  assumed  to  be  independent, 
results  from  the  following  analysis  hold  for  either  alone,  or  both,  so  the 
reader  can  choose  which  asoects  are  most  appealing  for  his  immediate  estima- 
tion concerns. 
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Let 


(2.3)  Y= 


and  assume  that  the  T  x  K  matrix  lis  of  full   rank.     We  propose  to  estimate 
3.  by 


(2.4)  b  =   (I'X)~^X'Y 


This  estimator  is  unbiased  because 


N  N  N 

(2.5)     E(Y  )  -     I     E(y.  )  =  Z  x   E(e.  )  +  z  E{t     ) 
t   ^-^i   n   .^^  n  -n   .^^   it 


=   E   X.,6  =  X 


i  =  l 


1 t-  -t- 


which  implies  that  E(Y)  =  Xp. 


3.1 


3.  Variance  Properties  of  the  Aggregate  Estimator 

For  each  N  we  can  compute  the  covariance  matrix  of  b^,  denoted  by  V|M(b^). 
We  are  interested  in  finding  conditions  so  that  the  elements  of  V|^(b^)  will 
remain  bounded  (or  go  to  zero)  as  N  increases.  Let  Z=^-  be  the  T  x  KT  matrix 


iiil 
0 


Ai2 


-i3 


^iT 


and  set 


r       ~i 

— 1 

^n 

s-1 

^i  = 

^i  = 

• 

iiT 

^•T 

L       J 

g=  u 

'lyh 

•  . 

Then 

N 
(3.1)      Y  =   Z   (z.3.  +  e. 
i=l  '^    ^  ^ 


and 


(3.2)     (b-6.)  =  fi  [  E  z.(3.-6.*)  +  £.] 


where  3*  is  the  KT  x  1  vector 
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It  is  now  convenient  to  define  the  KT  x  KT  matrix 


V.  = 


n^-(O)    r^.(l)    r^-(T-l) 


r.(T-l)    r.(T-2)    r,(0) 


which  represents  the  covariance  structure  of  the  stochastic  process  described 
in  (2.2c).  If  we  use  assumptions  b,  c,  d,  e  of  (2.2)  then 

N  "^2     1 

(3.3)     ]^(b)  =  E(b-B)(b-B)'  =  E  ^.  1^21  g'   +  .J:  o.{l'l)~\ 

Since  V^,(b^)   is  a  covariance  matrix,  the  Cauchy-Schwarz  inequality  implies 
that  in  order  to  make  estimates  about  the  magnitude  of  the  elements  of  lw(b) 
we  need  only  examine  the  diagonal   elements. 

Theorem  1 .     If 


(a) 

-Phnl^^ 

(3.4) 

1<<1<K 
l<t<T 

(b) 

l<p<TK 

then 
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(3.5)  [VN(b)],,  l(^T+a2)N(X!X)-} 


where  X  is  a  constant  independent  of  N. 


Proof.     Let   |c[^   ^|   denote  the  vector  whose  components  are  the  absolute  value 

of  the  components  of  2}^',  the  i^^  row  of  (L'D"^!' ,  and  1  denote  the  T  x  T 

matrix  with  each  component  equal   to  1.     Conditions   (a)  and  (b)  of  (3.4)   imply 

that 

(3.6)  sup       |(i..V.z:)_J  IX  <  - 

all   i       ^^^  P^ 

1<P£T 

l<q<T 


Now 


<3.7)      (i„(b))„  =  J^a'^'i,.  ii^Y'^'-J,  ^?(ri)-j 

and 

(3.8)  z    9.^%y.^^'^'^  =  tr    I    E,i,IJ.a'^^V^) 

i=l  '^^  i  =  l  ^^^ 


NA  trl|j'(^)l|^^^)| 


But  by  the  Cauchy-Schwarz  inequality 

(3.9)  trl|£'(^)|l^(^)|  =  il  |g>^|]^<T    I     (g/^V 

j=l     ^  j=l       ^ 

and 

(3.10)  l^  (g>')2=  Ia'"ll9'<*'l  =  (ri)-1. 
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The  inequality  (3.5)  follows   since  (2.2b)   implies  that 


E     o.    <   Ha    . 
i=l     ^  ~ 


Thus  we  have  shown  that  under  rather  plausible  conditions  we  can  examine 
V,^(b)  by  looking  at  N(X'X)"^  Condition  (3.4a)  merely  states  that  all 
elements  of  the  explanatory  variables  should  be  bounded  and  (3.4b)  imposes 
a  mild  restriction  on  the  covariance  structure  of  the  stochastic  processes 
generating  the  ^... 


4.1 

4.  The  Structure  of  (X'X)"^ 

From  the  above  discussion  it  is  possible  to  see  that  N(X'Xj~  plays  the 
crucial  role  in  determining  the  gains  that  might  be  obtained  from  aggregation. 
We  can  always  compute  N(K'X)'  but  it  is  useful  to  see  what  conditions  im- 
posed on  N{X'l)"^  imply  about  the  structure  of  the  micro  regression  equations. 

(a) 

Let  X^^  '  !!,=0,1 ,. . .  ,K-1  denbte  the  columns  of  X.  It  is  convenient  to 

assume  that  the  first  element  of  _x..  is  1  for  all  i  and  t.  Therefore  T  ' 

2 

is  a  column  vector  with  each  element  equal   to  N.     Finally  let  a^  be  the 

variance  of  _X^    '   (we  assume  a.   >  0,   £=1 ,. . .  ,K-1) ,  and  a         the  error  variance 

(o) 
for  the  regression  of  X^    '  on  the  remaining  K-2  explanatory  macrovariables. 


Theorem  2.     If  (3.4a)  holds,  then 


(4.1)  (I'X);}  =  -y-  ii=l,2,...,K-l 


and  T         1  m2i/2 

(4.2)  (i'DoJli-— ^T- 

NT       T       mm       a 
l<il<K-l      " 


Proof.     In  what  follows  p       stands  for  the  simole  correlation  between  macro 
explanatory  variables  X„   and  X     and  V„  denotes  the  coefficient  of  variation 
of  the  macro  variable  X.  . 
Fi  rst  we  note  that 


4,2 


(4.3)  [I'X]  = 


N^T 

Xo-^i  •  •  . 

X  '  •  X 

XJ>-X„ 

^■h .  ^ 

.  .  .  .  x^_i  x^_i 

^  -^- 



By  elementary  row  and  column  operations   [X'XJ  can  be  transformed  into  a 
partitioned  matrix  whose  lower  right  hand  partition  is  the  correlation 
matrix  [Cj. 


(4.4) 


[X'XJ  =   [£][£] 


where 


(4.5) 


[£][GJ 


1/V^        l/V^ 1/V^_^ 


'12 


'12 


'1,K-1      •    • 


VV^ 


1/V 


K-1 


[G] 


[£]  = 


NT 


i-x, 

T 

1-h 

0 

i-^-1 

0 

.    .   T 


'1,K-1 


•    1 


m 
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is  a  product  of  elementary  row  operations  that  reverse  the  subtraction  of 
means  from  second  moments  about  the  origin  and 


(4.6)     [G] 


'K-1 


is  used  for  elementary  row  and  column  operations  to  reverse  the  conversion 
of  centered  moments  into  correlations.  If  D_  =  C."  ,  then 


(4.7)     [x'xj"^  =  m'^ 


where 


T/N 

"1 

"2      • 

•    Vi 

0 

a 

0 

[GJ"^[FJ-^ 


(4.8) 


[£]" 


l/NT 
l-X. 


NT' 


I-^- 


NT' 
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and  a^  -  -  |^[y—  +^+  +y— —  ]. 


Since  the  matrix  [D]  is  now  the  inverse  of  a  correlation  matrix,  by 
a  well-known  formula   (Rao   (1965),  page  2?4) 


(4.9)  D„  =  !|  . 

From  this  we  can  evaluate  the  diagonal  elements  of  Cl'Xj"  by  reversing  our 
operations  with  the  elementary  matrices  [Fj  and  [Gj  to  establish  that, 


(4.10)    [^'xj;}  =  -|^  =  -^    £=1,2,...,K-1 

oj   To 
and  ^     ^•• 


(4.11)   [K'i]-j  =  -^-^(;i.;i.....;^) 


1  rl    ^n      ,   2  ,       ,  "K-1 


a-,    ar 
NT  h       W^  ^  V2        \.l 


A  typical  term  of  (4.11)  becomes 


"^  V    TN'^  ^1%   ^2^       ^-^^^i 
Now  using  the  fact  that  (X'X^)'  is  a  covariance  matrix  gives 


'"•"'    D.„    D..    IJ 


l  .U 


9  \i  \i   -      -  •  -V-  =  -kx'xj:'|x.x„  <  v{x'x]:l  /[x'xj'];  x.-5^„ 

N^T  ^\   ^j''^   TN^    ^=-J^J^-  '-=^30  L-=^M-j-e 


4.5 


and  condition  {3.^a)  implies  that 


(4.14)    _^<^M 


Jl=l,...,K-l 


Therefore 


(4.15)      Itn  \rl  1     "lax      -J-    - — — 
'^   ^         l<j<K-l      '        °£..°j 


and 

""      Nn    i<j£K-i    '     "£..°j 

1<£<K-1 


1     ^  M^K^  1 

'^   '  1<£<K-1      ^' 


which  completes  the  proof  of  Theorem  2. 
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5.  Limiting  Properties  of  Macrovariances 

The  previous  analysis  has  provided  bounds  for  V^(b_),  the  macro 
parameter  variance  in  terms  of  {^'X^)'  and  conditions  on  the  microvariables 
and  microparameter  variances,  when  the  macroparameters  are  defined  simply 
as  least  squares  estimates  based  on  the  macro  data.  We  now  discuss  under 
what  conditions  V^(b^)  tends  to  zero  as  the  number  of  elements  in  the 
aggregate  increases. 

It  is  clear  from  Theorem  1  that 


(5.1)    lim  V.,(b)  =  0 

[j->oo  ^^ 

if 


(5.2)     lim  N(X'X)"^  =  0 


and  conditions  (3.4a)  and  (3.4b)  are  satisfied. 
Theorem  3  implies  that  (5.2)  will  hold  if 


(5.3)  lim— 4— =0,     ^=0,1,2 K-1. 

2  2 

The  multiple  correlation  coffficient,  R    is  related  to  a     by  the  identity 

2 

2       "^i 
R    =1 2^  and  (5.3)  can  be  rewritten  as 

°£ 

(5.4)  lim   p  'S =0  )l=0,l,2,...,K-l 

N-Tofd-R^^J 


2 

In  order  for  (X^'X^)  to  be  invertible  we  must  have  R    <  1  for  all  a.     In 

most  applications  related  to  economic  data  it  is  reasonable  to  assume  that 
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a  6  >  0  exists  so  that 


(5.5)    sup  rJ   <  1  -  6 


If  (5.5)  holds  then  havinq 


(5.6)     Tim  -^  =  0 


is  enough  to  imply  (5.3). 

Define  the  average  variance  among  microvariables  as 


2   1  '^  1  T        -   2 


and  the  average  simple  correlation  amona  microvariables  as 

2      1  ^ 
(5.8)     r  =  TjTSTTT  ^  T  ^  ^ 


2    ,  1  I  ^^u-^£)^".^u-^u^ 


Then  it  follows  that 


(5.9)     J  =   — :. 

Taj   Tsni-(N-l)r] 

2 

In  any  cases  of  conceivable  interest,  we  would  expect  s  to  be  bounded  away 


from  zero.  In  many  economic  applications,  r  can  be  expected  to  be  positive 
and  bounded  away  from  zero.  (Since  Oj,  >  U  we  must  at  least  have  r  >_  l/(N-i; 
A  meaningful  industrial  aggregate  is  normally  composed  of  firms  with  common 
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production  methods  and  similar  customers.  While  in  the  short  run,  one  firm's 
gain  may  be  another's  loss,  fluctuations  in  market  demand  will  ordinarily  be 
shared  in  proportion  to  each  firm's  productive  capacity.  While  some  firms 
grov/  in  periods  of  declining  demand  and  others  fade  when  demand  is  arowino, 
this  "maverick"  behavior  is  unlikely  to  dominate.  In  many  cases  it  does 
appear  that  (N-l)r  will  increase  with  N.  Clearly,  however,  effectiveness 
from  the  point  of  view  of  agqreaation  gain  depends  on  the  strength  of  the 

average  correlation  among  entities  comprising  the  aggregate  (as  well  as 

2 
col  linearity  among  the  explanatory  variables  reflected  in  R   ).  One  might 

conjecture  that  the  larger  most  aggregates  become,  even  in  a  well  designed 

aggregation  procedure,  the  more  dissimilar  the  components  become,  thereby 

putting  definite  limits  on  the  amount  of  variance  reduction  that  can  in  fact 

be  achieved. 

We  have  examined  conditions  which  imply  that  V,,(b)  ->■   0  with  increasina 

N.  These  conditions  can  be  weakened  if  we  only  require  that  the  elements 

of  Vj^(b)  remain  bounded  as  N  becomes  large.  For  example,  because  of  the 

dissimilarities  introduced  as  the  number  of  components  increases  it  might 

be  that  (N-l)r  in  equation  (5.9)  is  bounded  (i.e.  r  is  not  bounded  away  from 

2  2 

zero).  In  tliis  case  N/Ta  would  be  bounded  (if  s  is  bounded  away  from  zero) 

and  of  course  V^(b)  would  not  necessarily  approach  zero.  Since  we  do  not 
observe  almost  zero  coefficient  variances  even  for  very  large  f^,  the  possi- 
bility raised  here  seems  to  be  occurring  in  practice. 
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6.  The  Theil  Weights 

Theil  (1954)  originally  showed  that  the  macro  parameters  can  be  related 
to  the  microparameters  through  a  set  of  aggregation  weights  which  are  con- 
structs obtained  through  regressing  each  micro  series  on  all  of  the  macro 
exogenous  variables,  the  resultino  regression  coefficients,  denoted  by  the 
column  vector  w|^^  being  the  weights.*  The  formal  relation  is 


(6.1)    w!^)  =  (X'X)-"'x'x  (^\ 


when  2ii         e.=0,l ,. . .  ,K-1   denote  the  columns  of  the  T  x  KT  matrix 


-iT 


It  follows  immediately  that 

N       /„^       0         for  p  =  £ 

Thus,  the  weights  for  corresponding  parameters  sum  to  unity,  and  sum  to  zero 
for  non-corresponding  parameters.  If  we  define  a  T  x  1  vector  of  residuals. 


(6.3)     u|^)  =  xj^)  -  xw_j^) 
then 


An  accessible  exposition  of  these  concepts  will  be  found  in  Klock  (1961) 
and  in  a  notation  similar  to  that  of  this  paper  in  Kuh  (1972). 


6.2 


(6.4)     xj^)  =  X   wW  +  uj^V 


It  is  now  possible  to  use  these  weights  to  relate  the  macro  and  micro 
equations.  A  few  definitions  are  necessary: 


^  =     [  ^j^^  u\ 


(0)  ,.(1)     ,.(K-1) 


(T  X  K) 


(U.)  =  t-th  row  of  U. 


t=l,2, 


Z  = 


E   X 

i=l 


i2 


,=1^" 


(T  X  KT) 


«.=[«(">,„('> w,*''-"] 


(K  X  K) 


"^1 

Q=- 

•  a 

M,  a- 

•  a  •• 

■^  a-- 

•  a 

a 

^1- 

•  a 

a    k^  • 

.  a  •• 

•a    w^- 

•   0 

0 

0 

^1 

0       0 

W 

0       0 

% 

(KT  X  KTN). 


Then  we  have 


6t3- 


(6,5)  Y  =  Ii 


^1 


^ 


and 


(6.6)  £ 


i=l 


[%),  i-i  +  .-t] 


.1^   l%)jl,j^^,jl 


T  X  1 


We  can  also  relate  these  weights  to  the  matrix   (X'X.) 


.v^-^ 


Theorem  3  .  If  (3.4a)  holds  then 


(6.7)     z  (wjjVlM^TNEd'D-h 
•j=l   'P  PP 


Proof.  The  proof  is  essentially  the  same  as  that  used  for  Theorem  1. 


(6.8)     '  (wi;))2=  '  >)x(^)x:(^V(p) 


i=l 


ip 


i=l 


=  tr;E  xi^^xi^v^^y^^ 


iNM^r  1|£'^P^||£^P^|  . 


The  remaining  steps  are  identical  to  (3.9)  and  (3.10)  in  the  proof  of 
Theorem  1 . 


6.4 


We  have  seen  in  Sections  3  and  5  how  agqreqation  gain  is  related  to 
'Ul'l)"^.  If  we  require  that  (5.2)  hold  then  Theorem  3  implies  that 


(6.9)     lim  I     {^\^J)^   =  0 
I^  i=l   ^P 


and  as  a  consequence 


(6.10)     limwi^^=0        ^=1....,K-1;  p=l,...,K-l 


l^ 


ip 


But  (6.2)  states  that 


(6.11)     I     wjj^  =  1 
i  =  l  ^^ 


for  all  n. 


ii  ) 
If  vje  view  the  diaqonal  weights  wh  '  as  proportions,  representing 


condition  for  aggregation  gain  is  that  no  micro  explanatory  factor  can  be  a 
large  proportion  of  the  aggregate  of  that  factor. 

The  earlier  paper  by  Kuh  (1972),  usina  data  for  four  industries, 
found  that  corresponding  aggregation  weights  frequently  resemble  proportions 
as  one  might  expect.  An  approximate  empirical  measure  of  aggregation  gain 
that  ignores  the  non-corresponding  weights  (which  typically  are  small)  can 
be  defined.  It  is  the  reciprocal  of  the  sum  of  squares  of  the  correspondina 
weights,  (or  alternatively  of  the  sum  of  squares  of  prooortional  shares) 
which  should  indicate  the  reduction  in  the  macro  parameter  variance  rela- 
tive to  the  averaoe  micro  parameter  variance.  The  empirical  results  showed 
that  aggregation  qain  usually  does  occur  and  that  the  predictive  value  of 


6.5 


the  measures  of  aggregation  gain  do  reasonably  well  although  not  uniformly 
so.  Further  theoretical  and  empirical  work  on  aggregation  benefits  and 
losses  in  terms  of  N(X'i)'^  will  be  pursued  in  subsequent  work. 
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